Generation of robust phonetic set and decision tree for Mandarin using chi-square testing
نویسندگان
چکیده
A phonetic representation of a language is used to describe the corresponding pronunciation and synthesize the acoustic model of any vocabulary. A phonetic representation with smaller phonetic units such as SAMPA-C for Mandarin Chinese and decision trees for parameter sharing are broadly applied to deal with the problem of large numbers of recognition units. However, the confusable phonetic representation in SAMPA-C generally degrades the recognition performance. In this paper, a statistical method based on chi-square testing is used to investigate the phonetic unit characteristics that are confusing and develop a more reliable phonetic set, named modified SAMPA-C. A corresponding question set for the modified SAMPA-C and a two-level splitting criterion are also proposed to effectively and efficiently construct the decision trees. Experiments using continuous Mandarin telephone speech recognition were conducted. Experimental results show that an encouraging improvement in recognition performance can be obtained. The proposed approaches represent a good compromise between the demands of accurate acoustic modeling and the limitations imposed by insufficient training data.
منابع مشابه
Using Chi-Square Testing in Modeling Confusion Characteristics for Robust Phonetic Set Generation
A phonetic representation of a language is used to describe the corresponding pronunciation and synthesize the acoustic model of any vocabulary. In order to obtain better phonetic representation, context-dependent units are used to model co-articulation effects between phones and have been broadly in speech recognition. However, this representation generally increases the number of recognition ...
متن کاملUsing Robust Decision Tree Construction for Continuous Speech Recognition
Context-dependent units using decision tree have been broadly used to model the co-articulation effects and the speech variation in speech recognition. Decision trees are generally constructed in a data driven way and guided by linguistic information that contains a priori phonetic knowledge. In this paper, a two-stage splitting criterion is proposed to effectively construct the decision trees....
متن کاملRobust tests for testing the parameters of a normal population
This article aims to provide a simple robust method to test the parameters of a normal population by using the new diagnostic tool called the “Forward Search” (FS) method. The most commonly used procedures to test the mean and variance of a normal distribution are Student’s t test and Chi-square test, respectively. These tests suffer from the presence of outliers. We introduce the FS version of...
متن کاملIrrelevant variability normalization in learning HMM state tying from data based on phonetic decision-tree
We propose to apply the concept of irrelevant variability normalization to the general problem of learning structure f r o m data. Because of the problems of a diversified training data set and/or possible acoustic mismatches between training and testing conditions, the structure learned from the training data by using a maximum likelihood training method will not necessarily generalize well on...
متن کاملRights Creative Commons: Attribution 3.0 Hong Kong License IRRELEVANT VARIABILITY NORMALIZATION IN LEARNING HMM STATE TYING FROM DATA BASED ON PHONETIC DECISION-TREE
We propose to apply the concept of irrelevant variability normalization to the general problem of learning structure f r o m data. Because of the problems of a diversified training data set and/or possible acoustic mismatches between training and testing conditions, the structure learned from the training data by using a maximum likelihood training method will not necessarily generalize well on...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Speech Communication
دوره 38 شماره
صفحات -
تاریخ انتشار 2002